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O i 1 Abstract 

■ In recent work of Hazan and Krauthgamer (SICOMP 2011), it was shown that finding an e- 

, approximate Nash equilibrium with near-optimal value in a two-player game is as hard as finding 

a hidden clique of size 0(log n) in the random graph G(n, ^). This raises the question of whether 
a similar intractability holds for approximate Nash equilibrium without such constraints. We 
give evidence that the constraint of near-optimal value makes the problem distinctly harder: a 
simple algorithm finds an optimal ^-approximate equilibrium, while finding strictly better than 
^-approximate equilibria is as hard as the Hidden Clique problem. This is in contrast to the 
unconstrained problem where more sophisticated algorithms, achieving better approximations, 
are known. 

Unlike general Nash equilibrium, which is in PPAD, optimal (maximum value) Nash equilib- 
rium is NP-hard. We proceed to show that optimal Nash equilibrium is just one of several known 
NP-hard problems related to Nash equilibrium, all of which have approximate variants which 
are as hard as finding a planted clique. In particular, we show this for approximate variants of 
f*** ' the following problems: finding a Nash equilibrium with value greater than 77 (for any r\ > 0, 

even when the best Nash equilibrium has value 1 — 77), finding a second Nash equilibrium, and 
\ finding a Nash equilibrium with small support. 

Finally, we consider the complexity of approximate pure Bayes Nash equilibria in two-player 
games. Here we show that for general Bayesian games the problem is NP-hard. For the special 
case where the distribution over types is uniform, we give a quasi-polynomial time algorithm 
matched by a hardness result based on the Hidden Clique problem. 

X 

c5 '. 1 Introduction 

The classical notion of Nash equilibrium is the most fundamental concept in the theory of non- 
cooperative games. In recent years, there has been much work on the complexity of finding a Nash 
equilibrium in a given game. In particular, a series of hardness results culminated in the work 
of Chen et. al |CDT09j . who showed that even for two-player (bimatrix) games, the problem of 
computing a Nash equilibrium is PPAD-complete, and thus unlikely to be solvable in polynomial 
time. 

Therefore, it makes sense to consider the complexity of approximate equilibria. In particular, a 
notion which has emerged as the focus of several works is that of an e- approximate Nash equilibrium, 
or e-equilibrium for short, where neither player can gain more than e (additively) by defecting to 
a different strategy (without loss of generality, all payoffs are scaled to lie in the interval [0, 1]). 
A straightforward sampling argument of Lipton et al. [LMM03J shows that in every game, there 
exist e-equilibria with support 0(logn/e 2 ), and so they can be found in quasi-polynomial time 
n O(\ogn/e ) k v exhaustive search. 
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On the other hand, finding good polynomial time approximations has proved more challenging. 
While finding a ^-equilibrium turns out to be quite simple (DMP09 |. more complicated algorithms 
have given a series of improvements [DM P071 IBBMIO} ITS08] . where the current best known is the 
0.3393-equilibrium shown by Tsaknakis and Spirakis [TS08]. A major open question in this area is 
whether or not there exists a PTAS for Nash Equilibrium (note that the algorithm of Lipton et al. 
gives a quasi-polynomial time approximation scheme for the problem). 

Recently, Hazan and Krauthgamer [HK11] have attempted to provide evidence for the optimality 
of the QPTAS of Lipton et al. |LMM03] . by showing a reduction from a well-studied and seemingly 
intractable problem (which can also be solved in quasi-polynomial time) to the related problem of 
finding an e-equilibrium with near maximum value (the value of an equilibrium is the average of 
the payoffs of the two players) . 

The problem they reduce from is the Hidden Clique Problem: Given a graph sampled from 
G(n, 5) with a planted (but hidden) clique of size k, find the planted clique. Since with high 
probability the maximum clique in G(n, i) is of size (2 — o(l))logn, it is easy to see that for 
constant 5 > 0, one can distinguish between G(n, |) and G(n, |) with a planted clique of size 
k > (2 + 5) log n in quasi-polynomial time by exhaustive search over all subsets of (2 + 5) log n 
vertices. It is also not hard to extend this to an algorithm which finds the hidden clique in quasi- 
polynomial time. 

On the other hand, the best known polynomial time algorithm, due to Alon et al. [AKS98J 
only finds cliques of size £l(y/n). In fact, Feige and Krauthgamer [FK03] show that even extending 
this approach by using the Lovasz-Schrijver SDP hierarchy, one still requires O(logn) levels of 
the hierarchy (corresponding to n^( logn ) running time to solve the SDP) just to find a hidden 
clique of size n 1 / 2-2 . The only possible approach we are aware of for breaking the £l(yjn) barrier 
would still (assuming certain conjectures) only discover cliques of size f2(n c ) for some constant 
c>0 |FK08llBV09] . 

Hazan and Krauthgamer show that finding a near-optimal e-equilibrium is as hard as finding 
hidden cliques of size C log n, for some universal constant C. Here, by near-optimal we mean having 
value close to maximum possible value obtained in an actual Nash equilibrium. Subsequently, 
Minder and Vilenchik [MV09] then improved this hardness to planted cliques of size (2 + 5) log n 
for arbitrarily small 6 > 00 Here, we will rely on the hardness assumption for hidden cliques of 
size Clogn for any constant C, and will not concern ourselves with optimizing the value of C. 

1.1 A Sharp Result For Near-Optimal Approximate Nash 

It is important to note that the problem considered in [HK11] is not equivalent to finding an 
unconstrained e-equilibrium. In light of the results of [HK111 lMV09j it is natural to ask to what 
extent the hardness for near-optimal approximate equlibrium gives an indication of hardness for 
unconstrained approximate equilibrium. Indeed, UK 1 II. in their concluding remarks, ask whether 
their methods can be used to rule out a PTAS for unconstrained Nash equilibrium. One of the 
messages of this paper is that these two problems are quite different in terms of approximability and 
that one should not yet be overly pessimistic about the possibility for a PTAS for unconstrained 
Nash equilibrium. Indeed, while there is a a polynomial time algorithm to find a 0.3393-equilibrium, 
we show that finding a near-optimal (i — r/)-equilibrium is hard. 

Theorem 1.1 (Informal). For every 77 > 0, finding a near-optimal (^ —rf)- approximate equilibrium 
is as hard as finding a hidden clique of size Clogn in G(n, ^). 

1 There is a small caveat: the reduction of MV09 only certifies the presence of a hidden clique (i.e. distinguishes 
the graph from the random graph G(n, i) w.h.p.) but does not identify the vertices of the clique. 
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As mentioned above, there is a simple polynomial time algorithm to find a ^-equilibrium, and 
we show that this algorithm can be extended to find a ^-equilibrium with value at least that of the 
best exact equilibrium: 

Theorem 1.2 (Informal). There exists a polynomial time algorithm to find a ^-approximate equi- 
librium with value at least that of the optimal true equilibrium. 

Thus, Theorem 11.11 is tight and unlike unconstrained Nash equilibrium, where stronger tech- 
niques yield approximations better than ^, near-optimal Nash equilibrium does not admit efficient 
"non-trivial" approximations (assuming the Hidden Clique problem is hard). 

1.2 The Bigger Picture: Hardness for NP-hard Variants of Nash 

Just like with unconstrained e-equilibrium, finding a near-optimal e-equilibrium can be done in 
quasi-polynomial time using the algorithm of [LM M03] . However, the exact version - finding a 
maximum value Nash equilibrium - is known to be NP-hard [GZ89J and therefore harder than 
its unconstrained counterpart which is in PPAD |Pap94|. In fact, maximum value Nash is one of 
several optimization variants of Nash equilibrium which are NP-complete. Other variants include: 
determining whether a bimatrix game has more than one Nash equilibrium [GZ89], finding a Nash 
Equilibrium with minimum support [GZ89J, and determining whether there exists an equilibrium 
with value at least 1 — i or all equilibria have value at most e/n (even for arbitrarily small e = 
e(n) > 0) [CS03]. We show that approximate-equilibrium variants of these problems are also as 
hard as Hidden Clique. 

For the problem of obtaining any non-trivial approximation to the optimal value of a Nash 
equilibrium, we prove the following theorem. 

Theorem 1.3 (Informal). For every r] > 0, finding an e-equilibrium with value at least rj is as hard 
as finding a hidden clique of size Clogn in G(n, §), even in a game having an equilibrium of value 

1 — T). 

For the case of determining whether a game has more than one equilibrium, note that by 
continuity considerations, every two-player game has an infinite number of e-equilibria. Thus, the 
appropriate approximate analog is to consider the problem of finding two e-equilibria with (at 
least) a certain total variation distance between them. We show that this problem is also as hard 
as Hidden Clique. 

Theorem 1.4 (Informal). For all sufficiently small e > 0, determining whether a game has two 
different e-approximate equilibria (say, having statistical distance at least 3e) is as hard as finding 
a hidden clique of size Clogn in G(n, |). 

We then move to the problem of finding an equilibrium with small support. Recall that 
by [LMM03], there exist e-Nash equilibria with support 0(logn/e 2 ). It is also known that for 
any 77 > 0, in certain two-player games all (i — ^-equilibria must have support at least logra/(l + 
log(l/r ? )) [FNS07] (the threshold of \ is tight, since the simple ^-equilibrium of [DMP09J has sup- 
port 3). As an approximate-equilibrium variant of the Minimum Support Equilibrium problem, we 
consider the problem of finding an e-equilibrium with support at most some threshold t, and prove 
the following hardness result. 

Theorem 1.5 (Informal). For every r) > ; finding a (^ — 77) -equilibrium with support size Clogn 
is as hard as finding a hidden clique of size Clogn in G(n, ^). 
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This can be seen as a complexity-theoretic analogue of the lower bound of [FNS07| mentioned 
above. Again, this contrasts with the situation for unconstrained Nash equilibrium, which is guar- 
anteed to exist, and admits stronger approximations. 

While these are all negative results, we again would like to stress that there is a positive message 
to this story: these problems are hard because they are approximate versions of NP-complete 
problems, not because they are approximate variants of Nash equilibrium. Therefore, these results 
should not be viewed as indications that Nash equilibrium does not have a PTAS. 

1.3 The Complexity of Approximate Pure Bayes Nash Equilibria 

Finally, we consider the problem of approximating pure Bayes Nash Equilibria (BNE) in two- 
player games. Bayesian games model the situation where the players' knowledge of the world is 
incomplete. In a Bayesian game, both players may be in one of a number of different states, known 
as types, representing what each player knows about the state of the world, and the payoff of each 
player depends on the type of both players in addition to their strategies. The types are distributed 
according to some joint distribution and are not necessarily independent. A pure strategy for a 
Bayesian game assigns to each type a strategy that the player plays when she is in that type. In a 
pure BNE, conditioning on any given type for a given player, the player cannot gain by changing 
his strategy for that type. See Section [6] for precise definitions. 

Conitzer and Sandholm [CS03J have shown that determining whether a given two-player game 
has a pure BNE is NP-complete. We show that this holds also for approximate pure BNE. 

Theorem 1.6 (Informal). Let e = 0.004. Then given a Bayesian game that admits a pure BNE, 
it is NP-hard to find a pure e-BNE for the game. 

However, this hardness result relies heavily on the joint distribution of the players' types being 
non- uniform (in fact, not even product distribution). We show that when the distribution over 
type pairs is uniform, there is in fact a quasi-polynomial time algorithm for e-approximate pure 
BNE (when a pure BNE exists). 

Theorem 1.7 (Informal). For every e > there is a quasipolynomial time algorithm to find a pure 
e-BNE in two-player Bayesian games with uniformly distributed types and in which a pure BNE 
exists. 

We remark that this algorithm extends easily to arbitrary product distributions over types but 
in order to keep the notation simple we restrict our attention to the uniform case. 

The algorithm is tight: it follows immediately from our hardness for Small Support Equilibrium 
that this problem is also as hard as Hidden Clique. 

Theorem 1.8 (Informal). For every n > 0, finding a (j — rf)- approximate pure BNE in a two- 
player Bayesian games with uniformly distributed types and in which a pure BNE exists is as hard 
as finding a hidden clique of size Clogn. 

1.4 Organization 

In Section Owe prove the results relating to approximate equilibria with good value: Theorem ll.il 
(Section [3J2]) , Theorem O (Section E3D , and Theorem O (Section E3]) . 

In Section |4] we prove Theorem 11.41 by a black-box application of Theorem 11.31 In Section [5] we 
prove Theorem 11.51 using similar techniques as for the hardness results of Section [3l In Section [6] 
we prove our results for Bayesian games, Theorems 11.61 IT7r| and 11.81 
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2 Preliminaries 



A bimatrix game Q = (M row , M co \) is a game defined by two finite matrices, M row and M co i, and two 
players: the row player and the column player. We assume throughout that the game is normalized 
so that both matrices have values in the interval [0, 1]. The row and column players choose strategies 
x and y respectively, where x,y are nonnegative vectors satisfying ^ X{ = YljUj = 1- A pure 
strategy is one with support 1 (i.e. a vector with 1 in one entry and in the rest). The row (resp. 
column) player's payoff is given by x T M vow y (resp. x T M co \y). 

A Nash equilibrium is a pair of strategies (x,y) such that neither player has any incentive 
to deviate to a different strategy, assuming the other player does not deviate. Formally, in an 
equilibrium, for all i, j we have ej M Iow y < x T M row y and x T M co \ej < x T M co \y. An e-approximate 
Nash equilibrium, or e-equilibrium for short, is a pair of strategies x, y where each player has 
incentive at most e to deviate. That is, for all i,j, 

ejM Iow y < x T M row y + e and x T M col ej < x T M col y + e. 



The value of a pair of strategies, denoted vg(x,y), is the average payoff of the two players, i.e., 

Vg(x, 



h3 



For a vector x S M n and S C [n], we write xs for the projection of x to the coordinates S. We 
write ||x|| = Ya=i \ x i\ f° r the norm of x. Thus, for a strategy (in other words, a probability 
distribution) x 6 [0, l] n we write for the probability that the player plays an element of S. 

Further, for a set S C [n] of strategies, we use vg\s(x, y) to denote the value of (x, y) conditioned 
on both players playing in S. Formally, 



vg\s{x,y) = . E 



M iaw {i,j) + M coi (i,j) 



i,j € 5 



vg{x s ,ys) 
\\ x s\\ ■ \\ys\ 



(If llx^H = or [|yg|| = 0, Vgig(x, y) is undefined.) 

Given an undirected graph G = (V,E), and (not necessarily disjoint) vertex sets Si,S% C V, 
we will denote by E(Si, S2) the set of ordered pairs {(i,j) € Si x S* 2 I 0\ J} £ E or i = j}. We will 
refer to d(S\, S2) = \E(Si, 5 , 2)|/(|S'i||52|) as the density of the pair (Si, S2). 

Finally, we shall make repeated use of the following standard Chernoff bound. 

Lemma 2.1 (Chernoff bound). Let Xi, . . . , X m be i.i.d. {0,1} random variables and let = 
iEEX;]. Then 



Pr 



< e - 2e2m . 



3 Approximate Equilibria With Good Value 
3.1 The Reduction 

In this section we describe the general reduction that we use to prove Theorems 11.11 and 11.31 and 
describe its properties. This reduction also forms the basis for the reductions we use to prove 
Theorems 11.41 11-51 and 11.81 It is based on the reduction of [HK11] . 

As in [HKllJ our soundness analysis proceeds by using the approximate equilibrium to find a 
dense bipartite subgraph of G. The following lemma shows that this is sufficient to recover the 
hidden clique. 
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Lemma 3.1 ([H K1H Lemma 5.3]). There exist universal constants c\ and c% such that the following 
holds. Let G be a sample from G(n, \) with a hidden clique of size Clogra for some C > c\. Then, 
given a pair of vertex sets Si, £2 Q N of size C2logn and density d(Si,S2) > 5/9 we can in 
polynomial time reconstruct the hidden clique (with high probability over G). 



The lemma is slightly different from Lemma 5.3 of [H K11] : there we start with a bipartite 
subgraph of density 3/5 instead of 5/9 but this minor difference only changes the value of the 
constant C2 - the lemma holds for any constant density strictly larger than ^. 

Let us now describe the reduction. It is controlled by three parameters a, f3, 7 G (0, 1). Setting 
these parameters appropriately gives the various hardness results. 



Reduction 3.2. Let G = (V,E) be an n vertex graph and A its adjacency 
matrix (with Is on the diagonal). Then, for parameters a, /3, 7 G (0,1), we 
define a (random) bimatrix game Q := Q(G, a, /3, 7) as follows. 
Let N = n c where c = (C2+I) log 1 //? for the universal constant C2 of Lemma [3.11 
Pick a random N x n matrix B whose entries are i.i.d. {0, 1} variables with 
expectation /3. Then Q = (M row ,M co i), where the payoff matrices are: 



M r , 



aA 
B 







M, 



col 



aA B 1 
7J 



(1) 



where J is the all-ones N x N matrix. 



We conclude this section with an additional lemma which shows how to obtain a dense bipartite 
subgraph given an approximate equilibrium of Q with certain properties. This lemma (and its proof) 
is analogous to [HKlll Lemma 5.2], but as we need it in larger generality we also give the proof. 



Lemma 3.3. Let Q be as in Reduction \S. 6 A Fix any s G [0,1], t G [0,1] and e G [0,1] such that 
1 — t — 2>y/s/2 > a + e, and let (x,y) be an e-approximate equilibrium of Q with the following two 
properties: 

• Both ||x[ ra ] || > 1 — t and \\y\ n ] \\ > 1 — t. 

• The conditional value vgu n ^(x,y) > (1 



s a. 



Then, given (x,y) as above, we can efficiently find vertex sets S±,S2 C [n] each of size C2logn and 
density d(S\,S2) > 5/9. 



In the proof of Lemma [3. 3 1 we shall use the following simple claim, which is analogous to [HKlll 
Claim 5.3]: 

Claim 3.4. Let S C [n] be a set of size \S\ < C2logn. Then, w.h.p. over Q, in any e-equilibrium 
in the above game, the probability mass a column (or row) player may place on such a set S is at 
most e + a. 



The proof of the claim follows by noting that for such S, with high probability, there is a row 
i of B such that B 
exactly 

(l - f3~ lSl )" < exp(-n- C2logl//3 iV) = exp(-n logl//3 ). 
We omit the remaining details as they are identical to [HK11, Claim 5.3]. 



l;J — 1 for all j G S. Indeed, the probability that such a row does not exist is 

TV 

-n 
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Proof of Lemma Wlh Let x, y G [0, l] n be the strategies x and y conditioned on playing in [n]. That 
is, x = X[ ra ]/||x[ n ] ||, and y = / 1 1 2/[n] II- The second given property of the strategy pair (x,y) can 
be rephrased as x T Ay > 1 — s. 
Let 

S[ = ji e [n] : e, T ^4y > 1 - 2^/3} . 

To obtain a lower bound on the cardinality \S[\, we shall bound \\xgi || by e + a and then appeal to 
the claim. By the first given property of (x,y), we have 

Wxs'J >l-t - 1| > 1 - t - P M \5./ 1|. 

We can bound the last term, ||5[ n ]\5j||, from above using Markov's inequality, viz., 



•[n]\Sjll = Pr _ 

1 i~x L 



I - ej Ay > 2^/3 <-^- = 3^/2. 

J 2^s/3 



Thus we have ||a;g/ 1| > 1 — t — 3y/s/2 > e + a and so by the claim, |S^| > C2 log n. Truncate by 
taking any arbitrary subset S\ C of cardinality = C2logn. 

Now let x G [0, l] n be the uniform distribution over Si. Note that x T Ay > 1 — 2y/s/3. Let 

S' 2 = |j G N : x T .Aej > 5/9} . 

The argument to lower bound \S' 2 \ is similar to the argument for \S[\. We get 



„V[n]\S' 2 \\ = Pr - 

and therefore 



1 - x T Aej > 4/9] < = 3v^/2. 



||^ || > 1 - t - 3^/2 >e + a, 

as desired. Again, we can truncate S' 2 by taking a subset S2 C of cardinality (S^l = C2 logn. By 
construction d(Si,S2) > 5/9, and we are done. □ 

3.2 Hardness for e close to | 

To obtain Theorem 11.11 the main requirement is to set a = 2 + 0(rj). The values of /3 and 7 are 
essentially irrelevant in this case - the only thing needed is that /3 is bounded away from both 
and a and that 7 < \. 

Lemma 3.5. Let a = \ -\- 1, 7 < ^ and Q be the game of Reduction \ 3.°A Then for any pair of 
strategies (x,y) with value at least vg(x,y) > a — t 2 it holds that ||x[ n ]|| and \\y\ n ]\\ are both at least 
l-t. 

Proof. Let p = ||x[ n ]|| and q = \\y[ n ]\\- As the value of any outcome outside the aA block is at most 
|, we have that the value of (x,y) is at most 

. 1 1 
pqa + {1 - pq)- = tpq + -, 

so that if the value is at least a — t 2 we have 

1 2 

tPQ + 2 — a ~ t 

a-t 2 -1/2 
pq > — = l-t 

Since p,q G [0, 1], it follows that they are both at least 1 — t. □ 
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Observation 3.6. Let (x,y) be any pair of strategies with value vg(x,y) > | and ||xr n ]|| > 0, 
\\y\ n ]\\ > 0. Then vgu n i(x,y) > vg(x,y), provided that 7 < i. 

Plugging this into Lemma 13.31 we can now easily complete the proof of hardness for e close to 

1 

2' 

Theorem 3.7 (Detailed Statement of Theorem II. ip . For every r\ > there exist 5 = Q(r] 2 ), 
a > \ and universal constant C not depending on r\ such that the following holds. Given a graph 
G = (V, E) we can in randomized polynomial time construct a bimatrix game Q with maximum 
value a (over all strategy pairs) such that, if G = G(n, |) with a hidden clique of size Clogn, the 
following holds (w.h.p. over G andQ): 

Completeness There is a Nash equilibrium (x,y) with value a. 

Soundness Given any (i — if) -equilibrium with value >a — 5, we can efficiently recover the hidden 
clique. 

Proof. Given a graph G = (V,E), we apply Reduction 13.21 with parameters as follows. For some 
t > to be determined momentarily, let a = ^ + 1, (3 = 7 = 1/3, 5 = t 2 . 

For the completeness, we shall show that having both players play uniformly over the hidden 
clique is an equilibrium. For this to hold, we have to make sure that there is no row in B with 
average value at least a in the positions corresponding to the clique. By Lemma 12.11 and a union 
bound over all rows of B we can bound the probability of this happening by 

Ne -2(a-/3) 2 Clogn < jy . ra -C/18 

If C is a sufficiently large universal constant (e.g., C = 18 • (02 + 1) log 1//3 + 1) this probability is 
o(l) and the completeness property follows. 

For the soundness, consider any (^ — ^-approximate equilibrium (x,y) with value at least 
vg(x,y) > a — 5 = a — t 2 . By Lemma 1331 both ||xr n ]|| and ||y[ n ]|| are at least 1 — t. Furthermore, 
by Observation l3~6l we have (x,y) > a- 5 > oj(1 - t 2 ). 

Now if \ — r\ < l — t — 3t/2 — a= \ ~ 7i/4 we can apply Lemma [3 .31 and extract a dense bipartite 
subgraph of G which can be plugged in to Lemma [3. II to obtain the hidden clique. Setting t = 4r//7 
we get the result. □ 

3.3 Distinguishing Between Low and High Value 

For Theorem II. 31 the choices of all three parameters a, /3, 7 of Reduction 13.21 are important. We are 
going to set 7 close to 0, and a > f3 both close to 1. 

On a high level, the proof has the same structure as that of Theorem 11.11 However, in the 
current setting Lemma 13.51 and Observation 13.61 do not apply. To arrive at similar conclusions we 
use a different argument, exploiting the fact that (x, y) is a e-equilibrium. Essentially, the argument 
is as follows: the off-diagonal blocks (B and B T ) are not stable, since there is too much incentive 
for at least one player to deviate. Therefore, most of the probability mass in an equilibrium is 
concentrated either in the aA block, or in the 7J block. However, in the 7J block, the value is 
too small. So, if the equilibrium has even slightly larger value, its mass must be concentrated in 
the aA block. There it has to actually have very large value, since otherwise, there is incentive for 
both players to deviate to B and B T to get reward /3. The rest of the proof follows as before. 

Formally, we have the following lemma, showing that (under certain conditions) any e-equilibrium 
with non-negligible value must satisfy the conditions of Lemma 13.31 
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Lemma 3.8. Fix a parameter £ G (0,1), let a — (3 < e, and 7 = 4y/e and consider the game Q as 
in Reduction 1 3. B 

Then, w.h.p. over Q , any e-equilibrium (x,y) with value more than 5y/e satisfies: 

• Both ||cc[ n ]|| and \\y\ n i\\ are at least 1 — yfe. 

• v g \ [n] (x,y) >a-3e. 

Proof. Consider any e-equilibrium (x, y) with value more than byfe. Note that this trivially implies 
that e < 1/25. 

Let p = \\x\ n ]\\ and q = \\y\ n ]\\ be the probability mass that the row (resp. column) player assigns 
to the first n strategies in this equilibrium. We begin with the first item, i.e., the lower bound on 
p and q. 

Consider the row player's incentive to deviate by shifting the probability mass in the first n 
rows to the uniform distribution over the remaining rows. When the column player is playing 
outside the first n columns, this deviation changes the row player's payoff from to 7, and when 
the column player is playing in one of the first n columns the row player's payoff decreases by at 
most a — f3 + o(l). Let us ignore this o(l) term. Since this is an e-equilibrium, we have 

p(l-qh-pq(a-/3)<e, (2) 

and so 

p(l-«) <-(e+pq(a-P)) 

7 

<-(e + a-/3) 
7 

<^~e/2 

Considering also the symmetric argument for the column player, this gives 

max{p(l — p),q(l — q)} < max{p(l — q),q(l — p)} < Ve/2, (3) 

where the first inequality holds trivially for all p,q E [0, 1]. The inequality p(l—p) < \fej1 together 
with the constraint e < 1/25 implies that either p < \fe or p > 1 — yfe. The inequality for q gives 
an analogous bound. Moreover, it cannot be the case that p<^<qoiq<^<p, since then we 
would have max{p(l — q), q(l — p)} > 7, contradicting ([3]). Thus, it follows that either p,q < yfe 
or p, q > 1 — yfe. 

Let us now exclude the first option. Suppose for contradiction that p,q < yfe. Then the value 
of the equilibrium is at most 

pqa + (p(l -q) + q(l - p)) ■ - + (1 - p)(l - 5)7 < ea + yfe/2 + 7 < ea + A.hfe < 5y/e, 

contradicting the assumption that (x, y) has value more than 5y/e. Hence both p and q are at least 
1- yf^. 

It remains to give a lower bound on the conditional value w := vgu n -\(x, y) obtained when playing 
inside aA. As above, consider the incentive for the row player to deviate to the uniform distribution 
over. To be more precise, we can use w instead of a in the bound ([2]). Solving for w gives 

w>f3 + p{1 - qh - £ >P-^>P-2e>a-3e, 
pq pq 

where we used that — < 1 „ 2 < ?| < 2 for e < 1/25. □ 
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Equipped with Lemma 13.81 it is easy to finish the proof of Theorem 11.31 

Theorem 3.9 (Detailed statement of Theorem II .3p . For every constant n > there exists = ^l(i] 2 ) 
and C = 0(l/ry 3 ) such that the following holds. Given a graph G, we can in randomized polynomial 
time construct a bimatrix game Q such that, if G = G(n, |) with a hidden clique of size Clogn, 
the following holds (w.h.p. over G and Q ): 

Completeness There is a Nash equilibrium (x, y) with both players earning payoff 1 — rj. 
Soundness Given any e-equilibrium with value > n, we can efficiently recover the hidden clique. 

Proof. Given a graph G = (V,E), we apply Reduction 13. 21 with parameters as follows. 

Let e = (rj/5) 2 , a = 1 — n = 1 — j3 = a — e, and 7 = Assume without loss of generality 
that n is small enough so that a > 3/4. 

For the completeness, we proceed as in the proof of Theorem 13. 71 We can upper bound the 
probability that the uniform distribution over the hidden clique is not an equilibrium by 

Ne -2(a-/3) 2 Clogn < ^-^(CV) _ 

We have N = n O( lo gV/3) = n O{v) = n O{V^) . Letting C be a sufficiently large multiple of 1/e 1 ' 5 the 
completeness property follows. 

For the soundness analysis, take any e-approximate equilibrium (x,y) with value at least 5y/e. 
By Lemma 13.81 w.h.p. (x,y) satisfies the conditions of Lemma 13.31 with t = y/e and s = — < 4e. 
The only remaining thing to check is the condition a + e < 1 — t — 2>y/s/2, which is easily verified 
(1 — Ay/e is an upper bound for the LHS and a lower bound for the RHS). 

□ 

3.4 An Algorithm For Good ^-Approximate Equilibria 

In this section we prove Theorem 11.21 by describing a simple algorithm to find a ^-approximate 
Nash equilibrium with at least as good value as the best exact Nash equilibrium. This shows that 
the bound on e in Theorem II. II is tight. 

For general ^-approximate equilibria (without any constraint on the value) , the following simple 
algorithm was suggested by Daskalakis, Mehta and Papadimitiou [DMP09]. Start by choosing an 
arbitrary pure strategy e$ for the row player, let ej be the column player's best response to ej, and 
let Ck be the row player's best response to ej. Then the following is a ^-equilibrium: let the column 
player play ej, and let the row player play e« with probability \ and et with probability | (neither 
player can gain more than | by deviating, since each player is playing a best response strategy with 
probability ^). Thus, every bimatrix game has a ^-approximate equilibrium in which one of the 
players plays a pure strategy. We show that this is also the case for optimal value ^-equilibria. 

Lemma 3.10. For every bimatrix game which has a Nash equilibrium of value v, there exists a 
^-approximate equilibrium with value at least v in which one of the players plays a pure strategy. 

Proof. Let M row and M co \ be the payoff matrices for the row and column players, respectively. Let 
(x*,y*) be a Nash equilibrium in this game of value v := vg(x* , y*), and let v r and v c be the payoff 
in this equilibrium to the row and column players, respectively (hence v = ^(v r + v c )). Without 
loss of generality, assume v r > v c (otherwise, a symmetric argument applies). 

Let the pure strategy e,- be some strategy in the support of y* for which the row player's payoff 
is at least v r (when the row player is playing x* and the column player ej). Such a strategy exists 
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since v r is the expected payoff for the row player when the column player plays according to y* . 
Furthermore, any such ej is a best response to x* since (x*,y*) is an equilibrium. 

Clearly, if the pair of strategies (x* , ej) is a ^-equilibrium we are done, since both the row and 
colum player are getting at least the same payoff as for the pair [x* ,y*). 

If (x*,ej) is not a ^-equilibrium, this must be because the row player has incentive > \ to 
deviate (as the column player is by definition playing a best response). Note that this implies that 
v c < v r < 7} since the row player's incentive to deviate can never be more than 1 — v r . Let e/% 
be some best response for the row player, and consider the pair of strategies (^(x* + ef.),ej). As 
above, this is a ^-equilibrium, since both players are playing a best response with probability ^. 
Furthermore, the payoff for the row player is at least v r /2 + {v r + \/2)/2 > v r + l/4 > v r + v c /2, and 
the payoff for the column player is at least v c /2. Thus the value is at least \{v r + v c /2 + v c /2) = v, 
and we are done. 

□ 

Now our algorithm follows immediately. 

Proof of Theorem \1.2\ Let M row and M co \ be the payoff matrices for the row and column matrices, 
respectively. By Lemma 13.101 there exists such an equilibrium in which one player plays a pure 
strategy. Suppose this is the column player (otherwise, a symmetric algorithm applies). Try all 
possible pure strategies ej for the column player. For each such strategy, solve the following linear 
program (if it is feasible): 

maximize ^x T (M c + M row ) ej 
subject to x 1 M co \eji < x 1 M co \ej + ^ 

(ei) T M row ej < x T M row ej + \ 

Xi > 

Y. Xi = 1 

i 

For at least one strategy ej, this LP is feasible and computes a ^-equilibrium with at least the 
desired value. □ 



4 Finding A Second Equilibrium 

In the following Theorem, c^tv refers to the total variation distance between two vectors, i.e., 
dTv(x,y) = \ \xi - y%\. 

Theorem 4.1 (Detailed Statement of Theorem 1 1.4p . There is a C > such that the following holds 
for all sufficiently small e > 0. Given a graph G we can in randomized polynomial time construct 
a bimatrix game Q' which admits a pure Nash equilibrium (ei,ej) such that, if G = G(n, |) with a 
hidden clique of size Clogn, the following holds (w.h.p. over G and Q' ): 

Completeness There is a Nash equilibrium (x,y) such that dTv( e i> x ) = dTv{ e j^y) = !• 

Soundness Given any e-approximate equilibrium (x,y) of Q' with dTv( e i: x ) ^ e + 0(e 2 ) or 
dTv{zj-,y) > e + 0{e 2 ), we can efficiently recover the hidden clique. 

Remark 4.2. Note that the bound e+0(e 2 ) on the statistical distance is almost tight: given any true 
equilibrium (x,y) there are e-approximate equilibria (x',y') with dTy(x,x') > e and dTy(y,y') > e. 



11 



Proof. Given G, first construct the game Q of Theorem 13.91 with parameter rj = 1/10, so that given 
a (4e)-approximate equilibrium of Q with value at least 1/10 we can reconstruct the hidden clique. 
Let (M row ,M co i) be the payoff matrices of Q and let N denote their dimension. Consider the new 
(N + l)-strategy game Q' = (M r ' ow , M^ ol ), with the following payoff matrices. 



ML 



A 



M col A 
1 



(4) 



where A is the 1 x JV vector with each coordinate equal to A. We set A = 8/10. Q' is a bimatrix 
game of size (N + 1) x (N + 1). 

Clearly, (ejv+i, ejv+i) is a pure Nash equilibrium of Q' . Note that for any mixed strategy x, we 
can write d,Tv(eN+i,x) = 1 — %n+i so it suffices to obtain good bounds on xn+i and jjn+i- 

Furthermore the completeness case also follows immediately since in that case Q has a Nash 
equilibrium with both players earning payoff 9/10. As A < 8/10 this is an equilibrium in Q' as well 
and since it does not use the (N + l)'st strategy we obtain the completeness property. 

For the soundness, consider any e-approximate equilibrium (x, y) of Q' , let p = [|a?nyi[|, q = 
||y[jv]|| be the probability that the row (resp. column) player plays in the original game Q where 
xn + \ < 1 — s — 0(e 2 ) and yjv+i < 1 — e — 0(e 2 ). We need to show that (x, y) can be used to recover 
the planted clique. 

Let (x,y) = {x\m/<p,y\m/q) denote the strategies conditioned on playing on the first iV strate- 
gies. We claim that (x,y) must be an e'-approximate equilibrium for the original game Q, with 
e' = To see this, suppose for contradiction that one of the players, say the row player, gains e' 
in Q by deviating to some strategy ey. Consider "lifting" this to a new strategy x' for Q' (i.e., in 
the strategy x' the row player plays e^+i with probability 1 — p, and with probability p). The 
change in payoff the row player obtains in Q 1 by switching from x to x' can be written as 



p{qs' + (i - q) ■ o), 



(5) 



where the qe' term is what the row player gains from when the column player plays on the first iV 
strategies, and the other term is since, when the column player plays on N + 1 the row player 
gets the same payoff on all the first N strategies. As (x,y) is an e-approximate equilibrium in Q' ', 
dm must be bounded by e and hence 

e' < - (6) 



£ 

pq 



Now the same argument as in the first part of the proof of Lemma 13.81 gives that 



e>pg(A-l)+p(l-g)(l-0) 
= p(l - llg/10) 
>p(l - llp/10). 



(row player's incentive to deviate) 
(by choice of A) 



This quadratic inequality in p implies that either p < e + 0(e 2 ) or llp/10 > 1 — t 
first possibility is ruled out by our assumption, therefore p > tj(1 — e — 0(e 2 )) 



-0(e 2 ). The 
and similarly 



q> ^{l-e-0{e 2 )). 

It is easy to see that the value of (x,y) must be at least 1/10 because (x,y) is an e-equilibrium 
for Q' . By the choice of parameters, if (x,y) is also a (4e)-approximate equilibrium for Q then we 
can reconstruct the hidden clique and we are done. But this follows easily from (jGj) . 

□ 
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5 Small support equilibria 



In this section, we show hardness of finding an e-approximate Nash equilibrium with small (logarith- 
mic) support when one exists, even for e close to i. Note that an e-approximate Nash equilibrium 
for two-player n'-strategy games with support at most 0(logn'/e) is guaranteed to exist by the 
algorithm of Lipton et al. [LMM03] . Here we consider approximate equilibria with smaller (but still 
logarithmic) support. Also, note that this is tight, since for e = i, we have the simple algorithm 
of [DMP09| . which gives a ^ equilibrium of support 3. 

Our reduction for small support equilibria involves the following construction, which is very 
similar to the earlier one. 



Reduction 5.1. Given a graph G = (V, E) with adjacency matrix A, construct 
a game Q = (M row , M CQ \) as follows. 



( aA 
B 



R 



\ 



/ 



M, 



col 



/ aA B 1 


V o 



J-R 



(7) 



/ 



where B is an Ni x n matrix whose entries are i.i.d. {0, 1} variables with ex- 
pectation f3. As in Reduction 13.21 N% = n c where c = (c2 + 1) log 1//3 and C2 is 
the constant from Lemma 13.11 

Each payoff matrix of ([7]) contains N2 identical copies of B, and R is an Aq x 
N1N2 uniformly random {0, 1} matrix. 



Theorem 5.2. For every r\ > there exists C > such that finding a 



rj)- equilibrium with 



support at most (logn)/2 is as hard as finding a hidden clique of size Clogn in G(n 



Proof. Given a graph G from G(n, |) (possibly with a hidden clique), construct the game Q as 



in Reduction 15.11 with the following parameters: let a = | +77/8, let /3 = a + (^ — rj) — r/ 2 /8 = 
1 — |r7 — r/ 2 /8. We choose the dimension A2 as N2 = n c ' , where d is chosen to satisfy (r/ 2 /8) 2 c' = 4c. 

0(l/r/ 3 ). We choose the density C of the hidden 



n + n c +c , so that we are looking for an 
Clogn for some c'/2 < C' < c'/2 + 1 



Since c = (C2 + 1) log 1//3 = Q(rj) we have that d 
clique to be C = d '/2 + 1. 

Note that the number of strategies is n + N1N2 
equilibrium with support at most | log(n + n c +c ) ; 
(assuming r/ is sufficiently small). 

The completeness follows easily. Suppose G contains a hidden clique of size C log n > C' log n. 
Then if both players play uniformly over the same subset of Clogn clique vertices, they both 
achieve reward a. The probability that, say, the row player can gain | — rj (i.e. get payoff a + ^—rj = 
(3 + r/ 2 /8) by deviating to some row in B (note that he can only deviate to rows in copies of B) is 
by the Chernoff bound Lemma 12. II and a union bound at most 



-OrVs) 2 ciogn 



n 



c-(n 2 /8) 2 C 



< n 



c-(« 2 /8)V/2 



n 



Now, for the soundness, consider any (^ — ^-equilibrium (x,y). Let us first show that both 
players must have most of their probability concentrated in the aA block. Let p = ||xr n ]|| and 
Q = l|y[n]|| (the probabilities that each player plays in the first n rows/columns). Let us consider 
the two player's incentive to deviate. In the aA block, the row player achieves at most payoff a, 
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and can achieve payoff f3 — o(l) by playing uniformly over all rows in B (w.h.p. this is true for 
all distributions over columns in [n]). In particular, there exists at least one row in B in which 
the row player can achieve this value. Now consider the right hand side of the payoff matrix. Let 
A 6 [0, 1] be the payoff that the row player receives in R (thus, the column player receives 1 — A 
here). For any row in B, there are N2 corresponding rows in R (one corresponding to each copy 
of B). Since the column player's support is at most C'logn, the probability that regardless of the 
column player's choice of support, there will be at least one row among these that has all l's in the 
corresponding positions is at least 

1 _ n C'logn^ _ g-C'logn^ > j _ / logn exp (_ 2 -C' lognjy^ 

— I _ n C'logn e -n- c '+ c ' 
> 1 _ n C>lo g n e -n«'/*-\ 

Thus, w.h.p. for every row in B and every possible strategy for the column player (up to the 
restriction on support size), there is a row in R corresponding to the correct row in B s.t. the row 
player would achieve payoff 1 in R (by deviating to this row). In particular, this is true for the 
row in B where the row player can achieve value f3 — o(l) (as before, we will ignore this o(l)). To 
summarize, by deviating, the row player can gain at least 

qp((3-a) + (l-q)(l-(l-p)\). 

Similarly, the column player's incentive to deviate is at least 

pq(j3 — a) + (1 — p)(l — (1 — q){\ — A)). 

On average, the two players' incentive to deviate is at least 

pq(J3 - a) + |((1 - p) + (1 - q)) - \{l - p)(l - q) = M (/3 - a) + ±(1 - pq), 

and since this incentive is at most 5 — 77, we have 

pq(P ~ a - \) + \ < \ - v, (8) 

or 

pq > -W- «)) = V/(V + W«) > 1 - rj/8. 

Now it remains to bound the conditional value w that is achieved in the aA block. This we can 
do using the same analysis as above, but substituting w for a. Making this substitution in dH) and 
solving for uu we have 

w > rj/pq + (3 — 5 = rj/pq + \ — 777/8 — r/ 2 /8 
> I + 77/8 - r] 2 /8 

= a(l - rf/{A + 7/)) > q(1 - rf/A). 

We are now in a position to apply Lemma 13.31 As stated Lemma 13.31 only applies to Reduc- 
tion 13.21 and not the present reduction but it can be verified that it works also in this case (what is 
needed is that the size of B is the same, and the presence of an approximate equilibrium with most 
mass in A and good value in A). We have t = 77/8 and s = r/ 2 /4 which is easily checked to satisfy 
the condition 1 — t — 3y/s/2 > a + (5 — 77). Thus we conclude the existence of a dense bipartite 
subgraph which, as before, allows us to reconstruct the hidden clique. 

□ 
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Note that we have a much smaller gap between the completeness and hardness above than 
in the other problems we have considered. In particular, we do not claim that finding a \ — rj- 
equilibrium with small support is hard even when an exact equilibrium with small support exists. 
However, modifying parameters in the above proof, such hardness can be shown for a smaller 
additive approximation: 

Theorem 5.3. For every r\ > there exists C > such that finding a (4 — if- equilibrium with 
support at most O(logra) in a two-player game which admits a pure Nash equilibrium is as hard as 
finding a hidden clique of size Clogn in G(n, |). 

6 Computing approximate pure Bayes-Nash equilibrium 

Bayesian games model the situation where the players' knowledge of the world is incomplete. In 
this paper we focus on Bayesian games with two players, but the results generalize to an arbitrary 
number of players. More details on Bayesian games can be found in most Game Theory textbooks, 
for example in |FT91| . 

In a Bayesian game the payoff of the players depends on the state of the world in addition 
to the players' strategies. In a situation with two players, the row player and the column player, 
each player is presented with a signal, called type, about the state of the world 6 row G 6 row and 
$coi G ©cob respectively. The types are distributed according to some joint distribution V and 
are not necessarily independent. The types determine the payoff matrices M row (# row , 9 co \) and 

M col (e 

row; $coi)- Denote the set of rows and columns in this matrix by S row and S co \, respectively. 
Each player chooses an action s row G <Srow an d s co \ G S co \ from their respective set of actions. The 
payoff function of the first player is thus u TOW (s row , s co i,6 TOW ,6 co i) = M row (0 row , col ) SroW)Scol G [0, 1]. 
The payoff function u co i is defined similarly. The payoff matrices, that depend on the players' types, 
as well as the distribution on types is known to the players ahead of the game. 

A pure strategy for the row player in a Bayesian game is a function (that by a slight abuse 
of notation) we denote by s row : ©row — > S row that for each type ^row as observed by row player 
associates a strategy s TOW (6 row ) that the player chooses to execute. A pure strategy s co i : © co i — > 
S row is defined similarly. 

Denote by 7\. ow the distribution on player column player's types 6 co \ conditioned on the type 
# row being observed. For a pair of pure strategies (s roW ) s coi) the payoff function of the row player 
is given by 

Prow(^row) — ^ [ttrow (Srow(^row) ) ^cdl (^col) > ^row> ^col)] • 

Scol~"Ps row 

A pure strategy Nash equilibrium in a Bayesian game, is a pair of functions s row , s co \ such that 
for all types observed, neither player has an incentive to deviate from his current strategy. In other 
words, for each 6 rovn and for each s' row G S row , 

Prow(^row) ^ ^ [^row(s row , Scol(^col) > ^rowi $col)] ) 

Scol^flrow 

and a similar condition holds for p co \. 

Since a pure Nash equilibrium need not exist in non-Bayesian games, it need not exist in 
Bayesian games either. Moreover, while verifying whether a non-Bayesian two player game has 
a pure Nash equilibrium is trivial, verifying whether a pure Bayesian Nash equilibrium exists is 
NP-hard |CS03] , Furthermore, as the example in [CS03] demonstrates, this problem remains hard 
even when the distribution on types is uniform and the payoff does not depend on the players' 
types. 
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A pure e-Bayesian Nash equilibrium (e-BNE) is defined similarly to an e-Nash equilibrium. For 
each observed type 6 row , the incentive to deviate should be bounded by e: 

Prow(^row) ^ [^row(^row> ^col(^col)) ^row; ^col)] 

Scol~7 , 9 row 

A similar requirement should hold for the column player. 

We show that for general distributions on types and for some small constant e, finding a pure 
e-BNE in games where a pure BNE exists is still NP-hard. On the other hand, we also show that 
if the distribution on the players' types is uniform, whenever a pure BNE exists, a pure e-BNE can 
be found in quasi-polynomial time. 

6.1 General distributions on types 

We show that for some constant e, determining whether a pure strategy e-Bayes Nash equilibrium 
exists is NP-hard. Specifically, e = 0.004 suffices. We prove: 

Theorem 6.1. Let e = 0.004. Then given a Bayesian game that admits a pure BNE, it is NP- 
hard to find a pure e-BNE for the game. Moreover, it is NP-hard to solve the promise problem of 
distinguishing games that admit a pure BNE from games that do not admit a pure e-BNE. 

Proof. We give a reduction from the problem of 3-coloring 4-regular graphs, which is known to be 
NP-complete [Dai80] . Let G = (V,E) be a 4-regular graph with \V\ = n. The edges of G can be 
properly colored with 5 colors using Vizing's algorithm. In other words, we can compute a coloring 
c : E — > {1, ... ,5} such that for every two edges e\, ei incident to the same vertex, c(e\) / c(e2). 
We now design a Bayesian game such that: 

• if G is 3-colorable, then the game admits a pure BNE; 

• if G is not 3-colorable, then the game admits no pure e-BNE. 

The type of each of the players corresponds to a vertex in the graph, thus row = ©col = V. 
The distribution on types is such that with probability 4/5, (61,62) is a random edge in E; with 
probability 1/5, {61,62) = (6,6) is the same random vertex in V. We note that in this example 
there is a high degree of correlation between the types of the two players. 

Each player has 6 strategies available to him: S TOW = Seal = {1)2,3} x {0,1}. The first 
coordinate of s row (v) should be thought of as the "color" assigned by player A to vertex v, while 
the second coordinate is either "head" or "tails", the use of which will be explained later. We 
denote the first coordinate by s r ow(^)i G {1, 2, 3} and the second by s TOW (v )% G {0, 1}. 

The payoff functions are defined as follows (s row and s co i stand for s TOW (9 row ) and s co \(6 co i), 
respectively) : 



6row — ^col 


? 

(Srow)l = (Scol)l 


? 

(Srow)2 = (Scol)2 


u row 


Mcol 


yes 


yes 




1 


1 


yes 


no 


yes 


0.64 





yes 


no 


no 





0.64 


no 


no 




1 


1 


no 


yes 


yes 


0.01 x 2 c ^ row,&co1 ^ 





no 


yes 


no 





0.01 x 2 c ^ row ' yco1 ^ 



In other words, the first coordinate of the strategy represents vertex color. The players are 
rewarded for producing a consistent 3-coloring: for the same vertex 6 the colors (s TOW (9))i and 
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(scoi(#))i should match; for different vertexes connected by an edge the colors (s r ow(#row))i and 
(s co i(# co i))i should differ. If these conditions are satisfied, both players are rewarded with a payoff 
of 1. If a certain edge or vertex fails to satisfy the conditions, the games becomes a zero-sum games 
that depends on the second coordinate of the players' strategies. The payoff can take values of 0.02, 
0.04, 0.08, 0.16, 0.32 for edge pairs (0row,#coi) an d the value of 0.64 for vertex pairs (9,9). What 
makes these values interesting is that no combination of these values with coefficients of —1,0, +1 
adds up to less than 0.02. 

If the graph G admits a 3-coloring C : V — > {1, 2, 3}, then the pure strategy s TOW (v) = s co \(v) = 
(C(v),0) yields the optimal possible payoff of 1 for both players, and thus is a pure BNE. 

On the other hand, suppose that G is not 3-colorable. Let s row (v), s co \(v) be a set of pure 
strategies that is an e-BNE. Denote the expected payoff functions under these strategies by u row (v) 
and u co \(v). If the row player deviates in her strategy from (s row (v)i, s TO - w (v)2) to (s row (v)i,l — 
Sraw(v)2) her payoff only changes for type pairs where the conditions are not satisfied. Let us denote 
this change by A TOW (v). The contribution of each edge (t> TO w 5 ^coi) to A row (?; row ) and A co i(u co i) 
cancels out, and hence, summing over all edges, 

(A IOW (v) + A col (v)) =0. (9) 

vev 

By the way we designed our payoffs, the values of A row (t>) and A co \(v) are integer multiples of 
0.004. Moreover, if at least one edge adjacent to v is not satisfied, then A row (t;) cannot be equal 
to 0. Since G is not 3-colorable, it means that A row (v) ^ for at least one of the vertexes. By ([9]) 
this means that A row (t;) > or A co1 (d) > and thus A row (i;) > 0.004 or A col (v) > 0.004. This 
contradicts the assumption that (s row ,s co \) is an e-BNE. □ 

6.2 Uniform distribution on types 

In this section we show that in the case where the distribution on types is uniform, a pure e-BNE can 
be computed in quasi-polynomial time. This contrasts with the previously noted fact from [CS03J 
that computing a pure BNE is NP-hard even in this special case. As for other quasi-polynimial 
time computable approximate equilibria we've considered, whose exact variants are NP-hard, this 
problem is also as hard as Hidden Clique: 

Theorem 6.2. For every r\ > ; finding a (| — rj)- approximate pure BNE in a two-player Bayesian 
games with uniformly distributed types and in which a pure BNE exists is as hard as finding a 
hidden clique of size Clogn. 

Proof. This follows immediately from Theorem 15. 31 Let M row and M co \ be as in Reduction 15.11 with 
parameters set in order to achieve the j — 77-hardness of Theorem 15.31 Consider the Bayesian game 
in which both players have exactly C' log n types, 

the distribution over types is uniform, and the payoff matrices for each player are always the 
same M row and M co \ as above. Note that when there is a hidden clique, the following is an pure 
BNE: choose a subset S of C' log n clique vertices, and let each player play according to a one-to-one 
mapping from their type set to 5. On the other hand, it is easy to see that every pair of pure 
strategies for the Bayesian game corresponds to mixed strategies for the original game, where the 
incentive to deviate for, say, the row player, is the expected incentive to deviate in the Bayesian 
game (over all choices of types for the row player). In particular, if a player always has incentive 
at most j — T] to deviate in the Bayesian game, their incentive will be at most — n in the original 
game, and so by Theorem 15.31 we can recover the hidden clique. □ 
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We also note that our result generalizes to the case when the distribution on types is a product 
distribution, i.e. when 6 Taw is independent from 6 co \. To simplify the presentation we further assume 
that the type space is of equal size for both players, i.e. |0 rO w| = | ©col I = k. We prove: 

Theorem 6.3. In a two-player Bayesian game, suppose that the types are distributed uniformly on 
the space 6 row x co i, and that |0 row | = I ©col I = k, and \S row \ = \S co \\ = n. Assuming that a pure 
BNE exists, we can find a pure e-BNE in time n°^ logn+logfc ^ e ). 

Remark 6.4. The assumption in Theorem 16.31 can be relaxed to a pure (e/2)-BNE equilibrium 
existing (instead of an actual equilibrium). 

Proof. The proof is similar in spirit to the quasi-polynomial e-Nash algorithm of [LMM03] , but some 
additional work is needed. We first (approximately) guess the payoffs and the allowed strategies for 
both players for all possible types. We then use linear programming to produce a (not necessarily 
pure) (3e/4)-BNE. We then sample from this approximate non-pure BNE to obtain a pure e-BNE. 

For simplicity, denote 6 r0 w = ©col = {!)•••>&} and 5 row = S'coi = {1, • • • , n }- Let s row : @ row — > 
Stow, s co \ : @ co \ — > S co \ be a pair of pure equilibrium strategies (that we assume exist). Let 
and be the corresponding payoff values. In other words, pl° w is the payoff the row player gets 
when his type is i E {l,...,k} and he plays strategy j G {1, . . . , n}. The equilibrium assumption 
is that for all i, 

row row 

and a similar condition holds for p co1 . We first claim that we can recover all the values of p row and 
p co1 within an error of e/4 with probability > n~°^ log ^ nk ^^ £ ). 

Claim 6.5. There is a polynomial time algorithm that with probability > n - °(( log ( nfc ))/ e2 ) outputs 
values q*° w , q?f such that for all i,j, \q™ w - p™ w \ < e/8 and \q-f - p^f\ < e/8. 

Proof. We show how to approximate p row well with probability > n~°^°^ nk ^/ e2 \ The claim 
follows by approximating p row and p co[ independently. Pick a subset of m = [~(401og(n/c))/e 2 ] 
of the column player's types t±, . . . , t m G {1, . . . , k}. For each type t r guess the column player's 
strategy s r on this type. With probability n~ m we correctly guess all strategies, i.e. s r = s co \(t r ) 
for all r. Set 

1 m 



row 
Hj 



m 
r=l 



In other words, we calculate an estimate of the expected payoff p vow using only the value at the 
types we've guessed. By Hoeffding's Inequality, for each (i,j) the probability 

P[|^° w ~pT\ ^ £ / 8 l ^ 2 • exp(-me 2 /32) < l/(4nk). 
The claim follows by union bound. □ 

Allowing for a blow-up of n !! 10 ?^))/ 62 ) \ n running time, we may assume from now on that the 
correct values of q row and g co1 had been computed (up to an error of e/8), since in the end we can 
check whether the set of pure strategies obtained is a e-BNE. 

Next, we formulate a linear program that obtains a (3e/4)-BNE with payoff values close to 
qtow qcdl 'p^g var i a bles of the program are Xj° w and , where corresponds to the proba- 
bility that the row player plays strategy j on type i. For each type i let 

Mf w - max?™ 

3 
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be the maximum possible payoff the row player can attain on type i under payoffs <jf° w . We only 
allow X™ w to be non-zero when the corresponding payoff qj° w > MJ ow — e/8. This guarantees that 
the solution we obtain is a (3e/4)-BNE. In addition, we enforce the payoffs to be close to qlj W - For 
each i,j we have the constraint 



k 

y,z 



<e/4, 



This linear program is feasible since the pure BNE we 
assumed exists is a solution to it. Denote the resulting expected payoffs by i>™ w and v™ 1 . 

The solution thus obtained is a (3e/4)-BNE. To see this, we actually observe that the equilibrium 
is well supported: for each type i, each strategy j with Xf?™ / the payoff v\J" > Mf ow — 3e/8 > 
maxj, vl°r - 5e/8. 

To complete the proof we now obtain a set of pure strategies s' row : @ row ~~ > S row , s' col : 
©col — > S co \ by sampling j = s' row (i) according to the probability distribution X™ w . Assuming 
k = f2((log(n/c))/e 2 ), once again by Hoeffding's Inequality the payoffs will be e/8-close to the 
payoffs v™ w , and thus the resulting game will be in an e-BNE. 

In the case when k = 0((log(n/c))/e 2 ), i.e. the number of types is small, we can find the exact 
BNE by brute force, completing the proof of the theorem. □ 
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